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DETAILED ACTION 

Election/Restrictions 

1. Applicant's election with traverse of Group I, claims 1-7, and new claims 21-23 in 
the reply filed on 1 1/13/06 is acknowledged. Applicant further elected the species 
FRG1 . The traversal is on the grounds that the generic claims to 4q35 region of 
chromosome 4 encompass the elected species. This is found persuasive. 

The requirement is still deemed proper and is therefore made FINAL. 

Priority 

2. Priority is to the provisional application 60/418,024, filed 10/11/02. 

Sequence Compliance 
This application contains sequence disclosures that are encompassed by the 
definitions for nucleotide and /or amino acid sequences set forth in 37 CFR 1821(a)(1) 
and (a)(2). However, this application fails to comply with the requirements of 37 CFR 
1.821 through 1.825 because 37 CFR 1.821 (a)(2)(c-d) states that each sequence 
disclosed must appear separately in the sequence listing and in the text of the 
description and claims whenever described. For example, a SEQ ID NO: and sequence 
listing is required for the nucleic acid sequence shown in Figure 2C of the drawings. For 
instance, the SEQ ID NO: may be referenced in the "Brief Description of Drawings." 
See MPEP 2422 & 2431 . This is not meant to be an exhaustive list of places where the 
specification fails to comply with th sequence rules. The specification has not been 
checked to the extent necessary to determine th epresence of all possible mior errors. 
Applicant's cooperation is requested in correcting any errors of which applicant may 
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become aware in the specification. The application cannot issue unless it is in 
compliance. Correction is required. 

Information Disclosure Statement 

3. The IDS' of 12/02/04 and 9/16/05 have been considered by the Examiner. 

Claim Rejections - 35 USC §112 
The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

4. Claims 1-7, 10-11 and 21-23 are rejected under 35 U.S.C. 112, first paragraph, 

because the specification, while being enabling for providing a D4Z4 binding element 

with a specific sequence localized in region 4q35 of human chromosome 4, contacting 

the D4Z4 binding element with a test compound and determining whether the test 

compound interacts with the D4Z4 binding element, does not reasonably provide 

enablement for providing "a D4Z4 binding element," contacting the D4Z4 binding 

element with a test compound and determining whether the test compound interacts 

with the D4Z4. The specification does not enable any person skilled in the art to which 

it pertains, or with which it is most nearly connected, to make or use the invention 

commensurate in scope with these claims. Further, the specification is not enabling for 

determining the level of expression of recited genes, generic genes or named. 

The factors considered when determining if the disclosure satisfies the 
enablement requirement and whether any necessary experimentation is "undue" 
include, but are not limited to: 
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1) nature of the invention, 2) state of the prior art, 3) relative skill of those in the art, 4) 

level of predictability in the art, 5) existence of working examples, 6) breadth of claims, 

7) amount of direction or guidance by the inventor, and 8) quantity of experimentation 

needed to make or use the invention. In re Wands, 858 F.2d 731, 737, 8 USPQ2d 1400, 

1404 (Fed. Cir. 1988). 

The invention is highly complex because it is not known just what constitutes "a 

D4Z4 binding element" from the specification or the prior art. The specification provides 

a circular, open-ended, structurally open definition on page 3, wherein "providing a 

D4Z4 binding element, e.g., DNA comprising one or more D4Z4 repeats," constitutes 

the only limitation to the DNA structure. Also, the specification teaches that, "D4Z4 is 

highly polymorphic with a variable number tandem repeat (VNTR) structure D4Z4." This 

definition imports little structural limitation to the D4Z4 binding element. Further, the 

specification discloses a 27 bp "minimal binding element." However, it is not known if 

the D4Z4 element comprises one or some defined number of exact repeats of this 

element, or whether this "minimal binding element" is loosely repeated in a D4Z4 

binding element. It is not known from the specification what constitutes the structural 

limits of a D4Z4 binding element, nor what are the essential features of the D4z4 

binding element. Therefore one skilled in the art would not recognize a D4Z4 binding 

element as recited in the claims and defined in the specification. 

The prior art recognizes species variations in the sequences of the D4Z4 repeats 

(see Winokur et al., 1996), as well as D4Z4 repeats on all acrocentric chromosomes 

which include chromosomes 14, 15, 21 and 22. A blast analysis of a large fragment of 

the d4z4 repeat unit (accession D38024; appendix a; also see Lyle et al, 1995) shows 

divergence from the sequence in chromosome 10 (appendix b), indicating that 1) the 
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sequences are different, and 2) D4Z4 domains on the other acrosomal chromosomes 
listed above are divergent. Further, a blast of. multiple copies of the "minimal binding 
element," as it relates to claim 23 for instance, against human genomic sequence 
yielded no sequence homology to anything. Therefore, at best the art and the 
specification give only an open-ended definition of D4Z4 that lacks structural 
boundaries. The art and the specification recognize an open ended, wide range of 
deletions or rearrangements, or repeats as, as well as plainly describing it as a highly 
variable region. Furthermore, claims 2-7 are directed to a D4Z4 binding element in a 
cell that expresses 4Q35, which does not mean the D4Z4 element must be the one 
localized on 4q35. Thus, the method measures the binding to elements that have 
nothing to do with 4q35, and nothing to do with expression of genes in that region. 

The art (in this case, post filing date) indicates many other genes in the region of 
4q35 (Blair et al, 2005) than are described in the disclosure or in the prior art. See 
Table I, Blair et al. Thus, the specification could not have been enabling for a generic 
"4q35 gene" at the time of filing. The claims also recite measuring expression of FSHD 
region genes, such as FRG1, etc. However there is no description of such genes, their 
introns/exons, splice sites, promoter regions, and spice forms. Therefore one skilled in 
the art would not know what is being measured nor how to measure it. 

Given the variance across species, the existence of multiple D4Z4 regions 
throughout the genome, and the lack of apparent homology to the putative minimal 
binding element, what constitutes a D4Z4 domain is highly unpredictable. 
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The breadth of the claims is such that they encompass D4Z4 regions 
unrecognizable to one skilled in the art. 

Given the complex nature of the invention, the unpredictability of the art, the lack 
of adequate teachings or examples in the specification, and the breadth of the claims, 
one skilled in the art would not know how to make or use the invention as claimed. 

5. Claims 1-7, 10-11 and 21-23 rejected under 35 U.S.C. 112, first paragraph, as 
failing to comply with the written description requirement. The claim(s) contains subject 
matter which was not described in the specification in such a way as to reasonably 
convey to one skilled in the relevant art that the inventor(s), at the time the application 
was filed, had possession of the claimed invention. 

Applicant claims a method of identifying compounds by using a generic 
D4Z4 binding element, a generic 4q35 gene, and FRG1, FRG2, and ANT1 genes. 
However, the specification lacks a closed structural definition of such things. Moreover, 
these elements are not known in the art (as demonstrated above). One skilled in the art 
would not recognize such. No written description is provided in the instant specification 
as to what structurally constitutes nucleotide sequences "...comprising D4Z4 binding 
elements or 4q35 genes", the boundaries and functional elements of which would be 
unknown to one skilled in the art at the time the invention was made. The specification 
has not described, nor can it be reasonably visualized by one skilled in the art, the 
structural and functional elements attributable to chromosome 4, which represents the 
boundaries of the nucleotide sequences as currently claimed. For example, 
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chromosome 4 contains other undescribed promoters, coding regions, 5' and 3' flanking 
regions, exons and introns, each with their own unique structure and function that is not 
described in the instant specification. In short, a D4Z4 binding element and 4q35 
region, as written, are readable on the whole of chromosome 4, as well as other 
chromosomes. FRG1, FRG2, and ANT1 genes have undescribed promoters, coding 
regions, 5' and 3' flanking regions, introns and exons, each with their own unique 
structure and function that is not described in the specification. 

Vas-Cath Inc. v. Mahurkar, 19USPQ2d 1111, clearly states "applicant must 
convey with reasonable clarity to those skilled in the art that, as of the filing date sought, 
he or she was in possession of the invention. The invention is, for purposes of the 
'written description' inquiry, whatever is now claimed." (See page 1117.) The 
specification does not "clearly allow persons of ordinary skill in the art to recognize that 
[he or she] invented what is claimed." (See Vas-Cath at page 1116). As discussed 
above, the skilled artisan cannot envision the detailed chemical structure of the 
encompassed genus of structures claimed, and therefore conception is not achieved 
until reduction to practice has occurred, regardless of the complexity or simplicity of the 
method of isolation. Adequate written description requires more than a mere statement 
that it is part of the invention and reference to a potential method of isolating it. The 
compound itself is required. See Fiers v. Revel, 25 USPQ2d 1601 at 1606 (CMC 1993) 
and Amgen Inc. v. Chugai Pharmaceutical Co. Ltd., 18 USPQ2d 1016. 



Claim Rejections - 35 USC § 102 
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5. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

Claims 1-2, and 7 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Winokuretal(1996). 

Winokur et al provides a D4Z4 binding element (which is in a human 
lymphoblast; Figure 1, page 1568), contacts that element with DNA that hybridizes to 
the end of chromosme 4 (cosmid 88F8), which is contiguous with the D4Z4 binding 
element (see Figure 1). Thus, the test compound (DNA) interacts with the D4Z4 binding 
element. Steps (a)-(c) are accomplished by Winokur et al. Step (d) imports no 
limitations on the compound selected (i.e., the compound could be any compound); and 
step (d) necessarily flows from step (c) when the compound interacts with D4Z4. In 
other words, step (d) is inherently accomplished with the practice of steps (a)-(C) 
because "selecting the test compound as a candidate..." is accomplished when it is 
determined that a test compound interacts with D4Z4. 



Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Steven Standley whose telephone number is (571) 272- 
3432. The examiner can normally be reached on Monday through Friday, 8:00 AM to 
5:00 PM. If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Janet Andres can be reached on (571) 272-0867. 
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The fax number for the organization where this application or proceeding is assigned 
is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published 
applications may be obtained from either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through Private PAIR only. For 
more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



Ste/e, Standley, Ph.D. 
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AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



repeat unit 



HUMFSHD 3303 bp DNA linear PRI 29-MAY-2002 

Homo sapiens FSHD gene for facioscapulohumeral muscular dystrophy, 
complete cds, 4Z4 tandem repeat unit. 
D38024 

D38024.1 GI:871846 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 

1 (bases 1 to 3303) 

Lee,J.H., GotO/K., Matsuda,C. and Arahata,K. 

Characterization of a tandemly repeated 3.3-kb Kpnl unit in the 

facioscapulohumeral muscular dystrophy (FSHD) gene region on 

chromosome 4q35 

Muscle Nerve 2, S6-S13 (1995) 

7739628 

2 (bases 1 to 3303) 
Lee, J. 

Direct Submission 

Submitted (22-AUG-1994 ) Je Hyeon Lee, National Institute of 
Neuroscience, NCNP, Department of Neuromusclar Research; 4-1-1- 
Ogawa-higashi, Kodaira, Tokyo 187, Japan (Tel : 0423-4 6-1712, 
Fax:0423-46-1742) 

Location/Qualifiers 

1. .3303 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref -"taxon : 9606 " 
/chromosome=" 4 " 
/map="4q35-qter" 
/clone="c51" 
1. .3303 

/note="3303bps Kpnl fragment in FSHD gene region contains 
other repeat and sequence motive" 
/ f unct ion= " unknown " 
/ rpt_f ami 1 y= " D4 Z 4 " 
/rpt_t ype=tandem 
1..300 

/note="region with similarity to Lsau(GenBank X59423) , 
Part of tandem repeat D4Z4" 
/f unct ion= "unknown" 



repeat region 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&vaI=871846 
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/rpt_type=other 
misc feature 393.. 578 

/note="extremly G-rich region in 186bps. Part of tandem 

repeat. locus D4Z4" 

/function=" unknown" 
satellite 398.. 578 

/note="microsatellite of GGAGG, Part of tantem repeat 

locus D4Z4" 

/rpt_type=direct 
gene 417. .2978 

/gene="FSHD" 
CDS 417. .2978 

/gene= " FSHD" 

/note="ORF" 

/codon_start=l 

/product="facioscapulohumeral muscular dystrophy" 
/protein id=" BAA07227 . 1 " 
/db_xref="GI : 14 35038" 

/trans lation="MERGTGETRGAEGTLGGRQGGREAGRNGGRDRATQGLGAGPREP 
GTDGGRKAGRKSGPRPPGVAGPPASGKTVSVRRGLRAGPTAAAPAGGAPPIRPGSGAQ 
GVGGFLRDKRPGLGLPSGLHPRGSQTAHPQAEPCNAARGPQTRPRRSHTQDDGGVILV 
SEWLCPPEGGLLLTSLRPPKGWPCRLFAPGALRHPETCREGCKPGMVPSLSLPGSKPA 
TLQTPPRCRTRESIVRPSRRGGISSLGSRSGLLRGNEREPHACVCETVPATATPTGIA 
SFTERGPGTLKTPTEVQFHTPLHPPRLVSPCCRRVGAQRAASRSRGIPGEVRRAGPRN 
APPSPLPPLPLPLRLSGPTTTTATTPPPPPPPPTTTTTTTPPAGPRPRRPGSLPGWGG 
LSQGGSPPFMKGWSLPACGPLQGRLAGWLAVRAGLLAAPAAVHSPAEVHGSPPASLCP 
RPSVKFRPGLTAMALPTPSDSTLPAEARGRGRPRRLVWTPSQSEALRACFERNPYPGI 
ATRERLAQAIGIPEPRVQIWFQNERSRQLRQHRRESRPWPGRRGPPEGRRKRTAVTGS 
QTALLLRAFEKDRFPGIAAREELARETGLPESRIQIWFQNRRARHPGQGGRAPAQAGG 
LCSAAPGGGHPAPSWVAFAHTGAWGTGLPAPHVPCAPGALPQGAFVSQAARAAPALQP 
SQAAPAEGVSQPAPARGDFAYAAPAPPEPGRSPTLRLLGGLRTRAKAGRTGTRSATAC 
RAPARWHSLGPLKRGRRPRGACATHVPGESVVGLGPGSPGRRGGVGTPSRGSSTSPAR 
APGTPPPPRGRGRCKASRRPPRRSRSRRPGLHSPAACCWMSSWRARSFCSRRNLS" 
misc_f eature 1405. .1554 

/gene="FSHD" 

/note="Extremly C-rich region (69%) in 186bps. Part of 
tandem repeat locus 4DZ4." 
/function=" unknown" 
satellite 1470. .1536 

/gene-"FSHD" 

/note="microsatellite of CCA, Part of tandem repeat locus 
D4Z4" 

/rpt_type=direct 
satellite 1590. .1703 

/gene="FSHD" 

/note="microsatellite of GGCT, Part of tandem repeat locus 
D4Z4" 

/rpt_type=direct 
misc feature 1863.. 2037 

/gene="FSHD" 

/note="paired type homeodomain seq I, 

translation; PRRLVWTPSQSEALRACFERNPYPGIATRERLAQAIGIPEPRVQIW 
FQNERSRQLR" 
/function= "unknown" 
misc feature 2082.. 2264 

/gene="FSHD" 

/note="paired type homeodomain seq II, 

trans la tionGRRKRTAVTGSQTALLLRAFEKDRFPGIAAREELARETGLPESRIQI 

WFQNRRARH PGQG " 

/ f un c t i on= " un known " 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&val= 
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ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 

// 



ggtaccagca 
gtgggagcag 
cgccccctcc 
ggagaccccc 
gccggctcgc 
gggcccctgc 
gcccggcgca 
agagaggaac 
gggaggcagg 
gggagccggg 
gagtagcggg 
ggcccacagc 
agggagtggg 
acccgcgcgg 
cacagacccg 
agtggctttg 
ggccatgccg 
gcaagcccgg 
ctcccccacg 
cctctctggg 
tgtgcgagac 
ggcctggcac 
ggctggtttc 
ggatccctgg 
ctctccccct 
caccaccccc 
gacgccctgg 
tgaaggggtg 
ctgtccgggc 
ggagcccgcc 
cgatggccct 
ggccacggag 
ggaacccgta 
agcccagggt 
gggaatctcg 
ccgtcaccgg 
gcatcgccgc 
tctggtttca 
caggcggcct 
tcgcccacac 
gggctctccc 
ccagccaggc 
cctacgccgc 
gcctccgcac 
cccctgcgcg 
gcgccaccca 
ggggcggcgt 
cgcctccgcc 
ccaggagccg 
gagcccggag 
ggaggcctcg 
ctgctggagg 
gtggcctctc 
cccctccacc 
actccacacc 
cca 



ggtgggccgc 
cccgggccag 
ccacccccca 
gtcccgcgaa 
ctcctcctgt 
agccgcccag 
caccggggac 
gggagagact 
gaggaacgga 
gacggacggg 
accgcccgcc 
cgccgcgccg 
cggtttcctc 
ttcacagacc 
gccacggagg 
ccctcccgaa 
actgtttgct 
catggtgcct 
ttgccgcacg 
gtctcgctct 
cgtcccggca 
actcaagact 
tccctgctgc 
agaggtccgg 
tcctcttcgt 
cccccccacc 
gtcccttccg 
gagcctgcct 
aggcctcctg 
ggcctctctc 
cccgacaccc 
actcgtttgg 
cccgggcatc 
ccagatttgg 
gccctggccc 
atcccagacc 
ccgggaggag 
gaatcgaagg 
gtgcagcgcg 
cggcgcgtgg 
acagggggct 
cgcgccggca 
cccggctcct 
ccgggcaaaa 
gtggcacagc 
cgtcccaggg 
gggaacccca 
tccgcgcggc 
gcgccctggt 
tttctgcagc 
gaagaggcgc 
agctttagga 
tttcgcgggg 
gggctgaccg 
gcggagaact 



ctactgcgca 
agctctcctg 
ccccccaccc 
acaccgggcc 
cgcccccggg 
ctgccagcac 
gctgagcgtt 
agaggggcgg 
gggagagaca 
gggaggaagg 
tccgggaaaa 
gccggcgggg 
cgggacaaaa 
gcacatcccc 
agccacacgc 
ggcggcctgt 
cccggagctc 
tcgctctcct 
cgggaatcca 
ggtcttctac 
acggcgacgc 
cccacggagg 
cgacgcgtgg 
agagccggcc 
ctctccggcc 
accaccacca 
gggtggggcg 
gcctgtgggc 
gctgcacctg 
tgcccgcgtc 
tcggacagca 
accccgagcc 
gccaccagag 
tttcagaatg 
gggagacgcg 
gccctgctcc 
ctggccagag 
gccaggcacc 
gcccccggcg 
ggaacggggc 
ttcgtgagcc 
gagggggtct 
ccggagccgg 
gccgggagga 
ctgggcccgc 
gagtccgtgg 
agccggggca 
aggggcagat 
ctgcactccc 
aggcgcaacc 
ctcgctggaa 
cgcggggttg 
aacacctggc 
gcctgggatt 
gccattcttt 



cgcgcgggtt 
cctctccacc 
ccggaaaacg 
ccgcgcagcg 
ccaccgtcgc 
gggcggctgg 
ccaggcggga 
aagggacgtt 
gagcgacgca 
cagggaggaa 
cggtcagcgt 
caccacccat 
gaccgggact 
aggctgagcc 
aggacgacgg 
tgctcacgtc 
tgcggcaccc 
tgccaggttc 
tcgtcaggcc 
gtggaaatga 
ccacaggcat 
ttcagttcca 
gagcccagag 
cccgaaacgc 
ccaccaccac 
ccaccacccc 
ggctgtccca 
ctttacaagg 
ccgcagtgca 
cgtccgtgaa 
ccctccccgc 
aaagcgaggc 
aacggctggc 
agaggtcacg 
gcccgccaga 
tccgagcctt 
agacgggcct 
cgggacaggg 
ggggtcaccc 
ttcccgcacc 
aggcagcgag 
cccaacctgc 
ggcgctctcc 
ccgggacccg 
tcaagcgggg 
tggggctggg 
gctccacctc 
gcaaggcatc 
ctgcggcctg 
tctcctagaa 
gcacccctca 
ggacggggtc 
tggctacgga 
cctgccttct 
cctgggcatc 



tgcgggcagc 
agcccacccc 
cgtcgtcccc 
tccgggcctg 
ccgcccgccc 
cggcggaacg 
gggaaggcgg 
aggagggagg 
gggactgggg 
aagcggtcct 
ccggcgcggg 
tcgccccggt 
cgggttgccg 
ctgcaacgcg 
aggcgtgatt 
tctccggccc 
ggaaacatgc 
caaacccgcc 
atcacgccgg 
acgagagcca 
tgcctccttc 
cactcccctc 
agcggcttcc 
gcccccctcc 
caccgccacc 
gccggccggc 
ggggggctca 
gcggctggct 
cagtccggct 
attccggccg 
ggaagcccgg 
cctgcgagcc 
ccaggccatc 
ccagctgagg 
aggccggcga 
tgagaaggat 
cccggagtcc 
tggcagggcg 
tgctccctcg 
ccacgtgccc 
ggccgccccc 
cccggcgcgc 
caccctcagg 
cagcgcgacg 
ccgcaggcca 
gccggggtcc 
cccagcccgc 
ccggcgccct 
ctgctggatg 
acggaggccc 
gcgaggaaga 
gggtggttcg 
ggggcgtgtc 
aggtccaggc 
ccggggatcc 



cgcctgggct 
gccgcctgac 
tgggctgggt 
acaccgctcc 
gggcccctgc 
cagaccccag 
gcagagatgg 
cagggaggca 
gcggggccga 
cggcctccgg 
ctgagggctg 
tccggggccc 
tcgggtcttc 
gcgcgaggcc 
ttggtttccg 
ccgaaaggct 
agggaagggt 
acactgcaga 
ggaggcatct 
cacgcctgcg 
acggagagag 
caccctccca 
cgttcccgcg 
cccctccccc 
acgcctcccc 
cccaggcctc 
ccgccattca 
ggctggctgg 
gaggtgcacg 
gggctcaccg 
ggacgaggac 
tgctttgagc 
ggcattccgg 
cagcaccggc 
aagcggaccg 
cgctttccag 
aggattcaga 
cccgcgcagg 
tgggtcgcct 
tgcgcgcctg 
gcgctgcagc 
ggggatttcg 
ctcctcggtg 
gcctgccggg 
aggggtgctt 
ccaggtcgcc 
gcccccggga 
cccaggcgct 
agctcctggc 
cgggggagct 
ataccgggct 
gggcagggcg 
tccgccccgc 
ccggtgagag 
cagagccggc 
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>f T ref|NT 017795 . 18 1 HslQ 17951 13 Homo sapiens chromosome 10 genomic contig, referer 
Length=17 97220 

Sort alignments for this su 
E value Score Percent i 
Query start position Sub 

Features flanking this part of subject sequence: 
226 bp at 5' side: similar to double homeobox 4c 
1277 bp at 3 1 side: similar to double homeobox 4c 



Score = 739 bits (384), Expect =0.0 
Identities = 513/555 (92%), Gaps = 35/555 (6%) 
Strand=Plus/Plus 



Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 



1751312 
61 

1751372 
121 

1751431 
181 

1751490 
241 

1751550 
301 

1751608 
360 

1751658 
420 

1751718 
462 

1751777 
521 

1751837 



ggtaccagcaggtqgqccgcctactgcqcacqcgcggqtttgcgggcaqccgcctqggct 
I I I I I I I I I I I I I I II I I I I I I I I I I II I I II I I I I II I I I I I I I I I II I I I I I I II I I I 
GGTACCAGCAGGTGGGCCGCCTACTGCGCACGCGCGGGTTTGCGGGCAGCCGCCTGGGCT 

gtgggagcagcccgggccagagctctcctgcctctccaccagcccaccccgccgcctgac 
I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTGGGAGCAGCCCGGGC-AGAGCTCTCCTGCCTCTCCACCAGCCCACCCCGCCGCCTGAC 

cqccccctccccaccccccaccccccacccccggaaaacgcgtcgtcccctgggctgggt 



I I I I I I II I I I II I I I I I 

CGCCCCCTCCCCACCCCC- 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACCCCCCGCCCCCGGAAAACGCGTCGTCCCCTGGGCTGGGT 



g g a g a c c c c c g t c c c q c q a a a c a c c q q g ccccgcgcaqcgt c cgggcctqacaccgct c c 

1 1 1 1 1 1 1 1 1 1 1 1 1 ii i i i 1 1 1 1 1 1 ii i 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGAGACCCCCGTCCCGCGAAACACCGGGCCCCGCGCAGCGTCCGGGCCTGACACCGCTCC 

gccggctcgcctcctcctgtcgcccccgggccaccgtcgcccgcccgcccgggcccctgc 
i I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGCGGCTCGCCTCCTC-TG-CGCCCCCGCGCCACCGTCGCCCGCCCGCCCGGGCCCCTGC 

g g g c c c ctgcagccgcccagctgccagcacgg- g c g g c t g g c g g c gg a a c g ca gac cCC A 
I I It I I I I I I I I I I I I I I I I I 11111111111111111111111111 
AGCCGCCCAGGTGCCAGCACGGAGCGCCTGGCGGCGGAACGCAGACCCCA 

GGCCCGGCGCACACCGGGGACGCTGAGCGTTCCAGGCGGGAGGGAAGGCGGGCAGAGATG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGCCCGGCGCACACCGGGGACGCTGAGCGTTCCAGGCGGGAGGGAAGGCGGGCAGAGATG 

GAGAGAGGAACGGGAGAGACTAGAGGGGCGGA AGGGACGTTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1111111111 
GAGAGAGGAACGGGAGA-CCTAGAGGGGCGGAAGGATGGGCGGAGGGACGTTAGGAGGGA 

-GGAGGGAGGCAGGGAGGCAGGGAGGCAGGGAGGAACGGAGGGAGAGACAGAGCGACGCA 

I I I I I I I I I I I I I I I I I I I I I I -I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGGAGGGAGGCAGGGAGGCAGGGAGGCAGGGAGGAACGGAGGGAAAGACAGAGCGACGCA 

GGGACTGGGGGCGGG 535 " 
I I I I I I I I I I I I I I I 
GGGACTGGGGGCGGG 1751851 



60 

175137 
120 

175143 
180 

175148 
240 

175154 
300 

175160 
359 

175165 
419 

175171 
461 

175177 
520 

175183 
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